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Abstract. The Reverse Stein Effect is identified and illustrated: A sta- 
tistician who shrinks his/her data toward a point chosen without re- 
liable knowledge about the underlying value of the parameter to be 
estimated but based instead upon the observed data will not be pro- 
tected by the minimax property of shrinkage estimators such as that 
of James and Stein, but instead will likely incur a greater error than if 
shrinkage were not used. 

Key words and phrases: James-Stein estimator, shrinkage estimator, 
Bayes and empirical Bayes estimators, multivariate normal distribu- 
tion. 
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1. THE CASE FOR SHRINKAGE: THE STEIN 
EFFECT 

Suppose that X is an observed random vector in 
p-dimensional Euclidean space MP such that X = 
Y + 6, where 5 is an unknown location parameter 
and Y is an unobserved absolutely continuous ran- 
dom vector. Under the mild assumption that Y = 
X — 6 is directionally symmetric,^ it is easy to heuris- 
tically justify "shrinkage" estimators for 6 of the 
form 



(1) 6, 



6j{X;6o) 



:-f{X-6o)-iX-6o)+6o 



where 7 = ^{X — 5o) E [0,1) and 60 is any fixed 
shrinkage target point in W. The improvement of- 
fered by such shrinkage estimators is often referred 
to as the Stein Effect. 
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^Y — —Y, where Y :— Y/\\Y\\ is the unit vector in the di- 
rection of Y (see Appendix A). 



First, for fixed 5 and 60, let Bi = Si(||(5 — (5o||; (^0) C 
R^ denote the ball of radius ||(5 — (5o|| centered at 5q 
and let H be the halfspace bounded by a hyper- 
plane dH tangent to Bi at 6 (see Figure 1). Then 

(2) {X\\\X-6o\\>\\d-5o\\} = B'i, 

Frs[\\X-6o\\>\\6-5o\m 

= Pvs[X e Blldo] 

>Pvs[XeH\6o] 
_ 1 

2' 

where (3) follows from directional symmetry by Pro- 
position 1(c) in Appendix A. Furthermore, under 
somewhat stronger but still general assumptions (see 
Proposition 2 in Appendix A), 



(3) 



(4) 



limPr5[||X-5o||>||<^-')^o||] 
= lim Pvs[X £B^] = 1. 

p— >oo 



Thus, ||X — ^oll is usually an overestimate of \\d — 
6o\\, so an estimator of the form ^{X — 5o) ■ {X — 60) 
for 6 — 6q should be preferable to X — 60 itself. Writ- 
ing (5 as {S — 6o) + 6o immediately leads to estimators 
for 5 of the form (1). 
Second (see Appendix B), 



(5) {X\3^€[0,l)B\\6^-6\\<\\X-6\\} 



B2, 



where 7 = ^{X — 5o,S — 60) is allowed to depend 
on 6 and B2 = B2{\\S — 6o\\; 6) is the ball of radius 
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Fig. 1. The balls Bi and B2 in (2) and (5). 



^\\6 — 6o\\ centered at ^{Sq + 5) = 6. Since B2 D Bf, 
also 

(6) Pts[X G 5| I 5o] > i 

and, under the assumptions of Proposition 2 in Ap- 
pendix A, 



(7) 



lim Frs[XeB^] = l. 



This shows that if 6 were known, then usually some 
shrinkage factor 7 applied to X — Sq will move X 
closer to 6, again suggesting a search for estimators 
of the form (1). 

2. THE STEIN PARADOX 

Assume now that Y '^ Np{0,a'^lp), the multivari- 
ate normal distribution with mean and covari- 
ance matrix cr^lp, where cr^ > is known, so X ~ 
Np{6,a'^lp). In this simple case, the James-Stein 
(JS) estimator for 6 is given by 

SjS^Sjs{X;6o) 



(8) 



1 



\X 



SoP 



{X-6o) + do, 



where 5o is a fixed but arbitrary point in 
truncated = "plus-rule" JS estimator 



The 



(9) 



-'JS 



6is{X;So) 



\\X-So\\'- 



{X-6o) + So 



is a shrinkage estimator of the form (1). These re- 
nowned estimators have the property that when 
p>3, they dominate X under both the mean square 
error (MSE) and Pitman closeness (PC) criteria:^ 
for every fixed 6, 60 £W, 



(10) 

(11) 
(12) 

(13) 
(14) 



Es[\\6+^{X;So)-6f\6o] 

<Es[\\SjsiX;5o)-5\\''\6o] 
<Es[\\X-6f]^pa', 

PTs[\\5is{X;5o)-5\\<\\X-6\\\So] 
>Pr5[||<5js(X;<5o)-(^||<||X-<5 



Pr 



Xp 



\5-6nf\ . \\d-6of 



4f72 



> 



4C72 



+ 



> 



and approaches 1 as p — )■ 00 if 



P-<5oll 



o{p) (apply 



Chebyshev's inequality), where Xpi'n) denotes a non- 
central chi-square random variate with p degrees of 
freedom and noncentrality parameter rj. Note espe- 
cially that: 



(A) the improvements offered by the JS estimators 
can be great, especially when p is large: ii6 = 60, 
then MSE(5|s) < MSE((5js) = 2a^ < pa^, and 
if \\5 — SqW = o{p) with o"^ fixed, then Pr5[||5 — 
5\\ < \\X — 6\\] — )• 1 as p — )• 00 for both 6 = Sjs 
and ^jg; 

(B) the MSE and PC dominances of X by 5js and 
6jg hold even if the true mean 6 is arbitrarily 
far from the shrinkage target Sq. 

Of the two properties (A) and (B), it is (B) that is 
most surprising, since it is not difficult to construct 
estimators that satisfy (A), for example, a Bayes es- 
timator w.r. to a normal prior centered at Sq. How- 
ever, such a Bayes estimator will not satisfy (B), the 
difference stemming from the fact that the Bayes es- 
timator will have a constant shrinkage factor, while 
the shrinkage factors in (8) and (9) are adaptive.^ 



^See Baranchik (1964) or Efron and Morris (1973) for (10), 
James and Stein (1961), Efron and Morris (1973), Arnold 
(1981), Anderson (1984), Berger (1985), or Lehmann and 
Casella (1998) for (11), our Appendix C for (12), and Efron 
(1975) or Sen, Kubokawa and Saleh (1989) for (13). In Efron's 
equation (2.11), page 265, the second inequality should be re- 
versed. 

^In fact, the JS estimator can be derived via an empiri- 
cal Bayes argument based on such priors — see Stein [(1966), 



REVERSING THE STEIN EFFECT 



When first discovered, the domination of X by 
the JS estimators was highly surprising, because the 
estimator X itself is as follows:^ 

(a) the best unbiased estimator of 5, 

(b) the best translation-invariant estimator of 5, 

(c) the maximum likelihood estimator (MLE) of 5, 

(d) a minimax estimator of 5, and 

(e) an admissible estimator of 5 when p = 1 or 2. 

So compelling were these properties of X that its 
domination by the JS estimators came to be known 
as the Stein Paradox.^ 

3. LOST IN SPACE: THE REVERSE STEIN 
EFFECT 

Star Trek, Stardate 4598.0: The Federation Star- 
ship U.S.S. Enterprise, about to rendezvous with in- 
terstellar space station Delta, was struck by a mys- 
terious distortion of the space-time continuum that 
disrupted all its power systems, including naviga- 
tion, communications, and computers. Out of con- 
trol, the Enterprise careened wildly and randomly 
through interstellar space at maximum warp for 
three days until, equally mysteriously, its warp drive 
went off-line and the ship came to a full stop. Cap- 
tain Kirk knew that, without power and commu- 
nication, their only hope for rescue was to launch 
a probe that would come close enough to Delta to 
be detected and convey their present location. 

By means of stellar charts, Lieutenant Ohura de- 
termined the present location X of the Enterprise, 
but because all computer records had been lost, the 
location 6 of station Delta was unknown. Mr. Chekov, 
fresh out of the Space Academy where he studied 
multivariate statistical analysis under Admiral Emer- 
itus Stein, immediately suggested a solution: 

"We can utilize the Stein Effect! Because the En- 
terprise essentially followed a random walk while out 
of control we know that X ~ N^{6, cr^ls), while from 
the duration of the disruption and the character- 
istics of our warp engines we know that a = 2400 
light-years. If we use the truncated James-Stein esti- 
mator (5J^(X; (5o) with p = 3 to estimate 5 by shrink- 
ing X toward a fixed point Sq, then by (11) and (14), 
5jg{X;6o) is more likely to be closer to Delta than 
our present location X is, no matter where Delta is! 



And what's more, we can shrink X toward any Sq 
that we like!" 

"Amazing!" Kirk said. "Now I wish I had paid 
more attention in my stats class," (smiling to him- 
self: but that's not how one makes Admiral!) "But 
what about (5o? To what shrinkage target point 
should we actually send our probe?" 

"Why, toward Earth, of course," Scotty^ said in 
his thick Scottish brogue. "The Scotch there is the 
best in the galaxy." 

"No, toward Qo'noS^" Lt. Worf^ exclaimed. "Per- 
haps they will send us some fresh qagh^ — I am so 
tired of this replicated stuff." 

"Permit me to suggest Denobula," Dr. Phlox^'' 
offered. "Tomorrow is the tenth wedding anniver- 
sary of my third wife and her fourth husband — 
perhaps the probe might convey my congratulations 
to them." 

Suggestions for the shrinkage target point Jq were 
soon received from every member of the 400-person 
crew, all except Mr. Spock. After several minutes he 
raised his left eyebrow and said "This is not logical. 
Please accompany me to the holodeck.^^" 

When the officers were assembled on the holodeck, 
Spock commanded: "Computer, ^^ construct a three- 
dimensional star chart showing the distribution in 
the galaxy of the homeworlds 6q of our crew mem- 
bers. What if any statistical properties does this dis- 
tribution possess?" 

"The dis-tri-bu-tion of home- worlds is such that Sq 
is di-rec-tion-al-ly sym-me-tric a-bout our pre-sent 
lo-ca-tion X ," the computer intoned monotonically. 

"Computer, display the following set: 

(15) {6o\3^G[0,l)3\\5^-5\\<\\X-6\\}, 

where 7 = ^{X — 6o,X — S) may depend on 6." 

"This set is ex-act-ly H'^, the com-ple-ment of the 
closed half-space H in Figure 2 on my mon-i-tor." 



page 356], Efron and Morris [(1973), pages 117-118], Arnold 
[(1981), Section 11.4]. 

''Cf. Berger (1985), Lehmann and Casella (1998). 

^Cf. Efron and Morris (1977). 



''A.k.a. James Doohan, who, during the writing of this pa- 
per, beamed out of this universe on July 20, 2005, the 36th 
anniversary of the first human landing on an extraterrestrial 
body. 

^The Capitol of the Klingon Empire. 

*Yes, we know, Worf didn't appear until Star Trek: The 
Next Generation — some slack, please. 

^A Klingon dish of serpent worms, best when served live. 

^"Okay, he appeared a century earlier on Star Trek: Enter- 
prise — more slack please. 

^^And still more slack. 

^^Ok, let's suppose that the computer power has been re- 
stored, but only momentarily. 
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Pr[Sn € H" | X] = i 




Fig. 2. The complement of H is the set (15). 

"Then, since Pr[(5o G Z/'^IX] = 2 by directional 
symmetry, this shows that shrinkage toward a ran- 
domly chosen 5q would have at most a 50-50 chance 
of moving X closer to 5 even when the shrinkage 
factor is chosen optimally for 5." 

"As for James-Stein shrinkage," Spock continued, 
"Computer, for representative values of 5, display 
the set of all 6q such that the James-Stein shrinkage 
estimator (5J^(X;5o) lies closer to 5 than does our 
present location X." 

"The two re-pres-ent-a-tive ca-ses are now dis- 
played in Fig-ures 3a and 3b^'^ on my mon-i-tor." 

"Thank you. Computer. It is apparent from these 
two displays," Spock said to the assembled officers, 
"that the set^^ of 5q such that James-Stein shrink- 
age toward Sq does more harm than good is quite 
extensive. Furthermore, since Mr. Chekov assures 
us that this choice can be made arbitrarily, in the 
interest of fairness, we may as well choose 5q at 
random from our crew members' homeworlds. But 
then, contrary to Mr. Chekov's assertion, (5Jg(X;(5o) 
is less likely to be closer to 5 than is our present 
location X." 

"More precisely, by the directional symmetry of 5q 
about X, it follows from Figures 3a and 3b that 



(16) 



Vt[\\5%{X-5o)-5\\>\\X-5\\\X] 
>Yi[5q(^H\X] = \. 



'^^See Appendix D for their derivation. 
^*This set is the complement of the cross-hatched region in 
Figure 3a or 3b 




(a) ||X-<5||<fVP^^ 
ai|Jf-iJ .. 

1-i 




(18) 



Fig. 3. The cross-hatched region is the set {5o\\\5'\^{X;Sq) — 
5\\<\\X-5\\}- 

If (5o is actually symmetrically distributed about X, 
then it is easy to see that 

(17) n^j^{X-5o)\X] = X, 

so by Jensen's inequality, 

mux-M-5f\x]>n\x-5f\x] 

Furthermore, under additional but still general as- 
sumptions,"^^ 

(19) lim Pr,[||5+ (X; ^o) - <5|| > \\X - 6\\] = 1. 

Thus, it is likely that James-Stein shrinkage will 
actually move us farther away from 6. I conclude, 
therefore, that we should simply tether the probe to 
the Enterprise and hope that Delta can detect our 
present location X." 



^See Proposition 3 in Appendix D. 
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"Boy, Spock, you are a party pooper," Bones^^ 
said. "I sure hope we don't shrink toward Vulcan." 
"Resistance is futile," said Seven-of-Nine.^''' 
"But, but, — I don't understand this," Chekov 
stammered. "How can the James-Stein estimator be 
inferior to X after all? Don't (16) and (18) contra- 
dict (14) and (11)? For example, under any proba- 
bility distribution for (5o, (11) yields 



(20) 



M¥Ux-M)-5\?]<M\\x-6r] 



= pa^ V(5e 



while (18) yields 



(21) 



Es[\\5J,{X;5o)-6f]>Es[\\X-6f] 



I am so confused!" 

"Beam me to the bar, Scotty," Kirk finally mum- 
bled. "Maybe I can figure this out after I belt down 
a few." 

4. TO SHRINK OR NOT TO SHRINK— THAT 
IS THE QUESTION 

Mr. Spock quickly assured Mr. Chekov that no 
formal contradiction had occurred: the probabili- 
ties and expectations appearing in (11), (14), (16), 
and (18) are conditional probabilities and condi- 
tional expectations with different conditioning vari- 
ables. Furthermore, the joint distributions of {X,5o) 
in (20) and (21) are different, having joint pdfs of the 
forms fsiX)f{5o) and fsiX)f{6o\X), respectively. 
In the former, X and Sq are independent, whereas 
in the latter, 6o is dependent on X. 

However, Captain Kirk's dilemma^^ remains: to 
shrink or not to shrink? If, according to property (B), 
the shrinkage target 5q can be chosen arbitrarily and 
still reduce the MSE and PC, can choosing 6o at ran- 
dom in some symmetric manner actually increase 
the MSE and PC? 



^•^Dr. McCoy. 

'^'' Right again, but how could we leave her out? 

^^Captain Kirk is "exactly in the position of Buridan's ass," 
as described in Barnard's discussion of the noninvariant na- 
ture of the James-Stein estimator in Stein [(1962), page 288]. 
The ass, when presented with two bales of hay, equidistant to 
his right and left, refused to move, seeing no reason to prefer 
one direction over the other. Like Barnard, we maintain that, 
in the absence of additional influences, such as prior infor- 
mation about the delectability of dextral vs. sinistral hay (or 
a loss function reflecting a negative effect of starvation), the 
ass's refusal to budge was correct. 



The short answer is yes, the Reverse Stein Effect 
is just as real as the original Stein Effect itself — both 
are simply manifestations of the strong curvature of 
spheres in multi-dimensional Euclidean space. Fig- 
ures 3a, 3b, and the results (16), (18), and (19) 
show that, without some prior knowledge of the lo- 
cation 5, Captain Kirk should not shrink X . If the 
shrinkage target 6q is chosen without reliable prior 
information but instead is based upon the data X, 
the minimax/Bayesian robustness property (B) of 
the JS estimator is lost and no longer guarantees 
that shrinking is not harmful on average. 

The implications for statistical practice are appar- 
ent. A shrinkage estimator is only as good as, but no 
better than, the prior information upon which it is 
based. Without reliable prior, as opposed to pos- 
terior,^'^ information, shrinkage is likely to decrease 
the accuracy of estimation. As Barnard^*^ concluded, 
if the statistical estimation problem is truly invari- 
ant under translation, then the best invariant esti- 
mator should be used, namely, X itself. 

APPENDIX A: DIRECTIONAL AND 

SPHERICAL SYMMETRY; 

VERIFICATION OF (4) 

Definition 1. y G M^ is directionally symmet- 
ric if y = —Y, where Y := TryTi is the unit vector 
in the direction of 1". Y is directionally symmetric 
about yo if y — yo is directionally symmetric. 

Clearly Y is directionally symmetric if Y is sym- 
metric: Y = —Y. Thus, any multivariate normal or 
elliptically contoured random vector Y centered at 
is directionally symmetric. Directional symmetry is 
much weaker than symmetry, as seen from the fol- 
lowing result. 

Proposition 1. Let Y be an absolutely contin- 
uous random vector in W . The following are equiv- 
alent: 

(a) Y is directionally symmetric. 

(b) Pr[y G C] = Pr[— y G C] for every closed con- 
vex cone C C M?'. 

(c) Pr[y G -ff] = I for every central (i.e., G dH) 
half space H<ZW. 

Proof. The implications (a) <^ (b) =^ (c) are 
straightforward. We will show that (c) =^ (a). Let P 



^^As represented, for example, by "data-dependent" priors. 
20Cf. Stein [(1962), page 
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(resp., Q) denote the probability distribution of Y 

(resp., Y). First note that since P[dH] = 0, P[H] = 
I is equivalent to 

(22) P[H] = PIH"] = P[-H] . 

Thus, for any two central halfspaces H and Hq, 

P[H n Ho] - P[H^ n H^] 
= P[H]-P[H^] 
= P[H']-P[Ho] 
= P[H'nH^]-P[HnHo], 
hence, 



(23) 



P[H n Ho] = P[H^ n H^] 

= P[i-H)n{-Ho)]. 
It follows from Lemma 1 below that 

(24) Q[AnSo] = Q[i-A)ni-So)] 

for every Borel set A '^S^ (the unit sphere in 
where Sq = HoCiSp. Thus, 

Q[A] = Q[AnSo] + Q[An{-So)] 

(25) =Q[(-A)ni-So)] + Q[i-A)niSo)] 

= Q[-A] 

for every such A, hence, (a) holds. D 

Lemma 1. Let Y be an absolutely continuous 
random vector in MP and let P and Q be as de- 
fined above. Suppose that Hq GW is a central half- 
space such that P[Ho] = ^, so also Q[So] = \ where 
So = HonSP.If 

(26) Q[S\So] = Q[-S\-So] 
for every hemisphere S C S^ , then 

(27) Q[A\So] = Q[-A\-So] 

for every Borel set A C 5^, which is equivalent to (24) 
because Q[±So] = P[±//o] = ^- Since every hemi- 
sphere S has the form H OS^ for some central half- 
space H , (26) is equivalent to 

(28) P[H I Ho] = P[-H 

for every central halfspace H C 1 
equivalent to (23). 

Proof. Without loss of generality, set Ho 

{{yi,---,yp~i,yp) |yp>o} so 

(29) So = < {yi, . . . ,yp-i,yp)^^yf = l,yp> 



Ho] 
which in turn is 



and let vr denote the stereographic projection^^ of Sq 
onto its tangent hyperplane Lq = {(yi, . . . , yp~i, 1)}. 
Then the relation 

(30) 7r{Sr\So)=K 

determines a bijection between the sets of all hemi- 
spheres S gS^ and all (not necessarily central) half- 
spaces K C Lo- 

Let Q denote the probability measure on S^ given 
by 

(31) Q[A]=Q[-A\-So], 
so (26) states that 

(32) Q[S\So] = Q[S] 

for every Borel set A Q S^. Let R and R denote 
the probability measures induced on Lq by Q[- | 5*0] 
and Q, respectively, under the mapping vr, that is, 

(33) R[B] = Q[tt-\B)\So], 

(34) R[B] = Q[7T-\B)] 

for every Borel set B C Lo. Then for each halfspace 
KcLo, 

R[K] = Qin-^K) I So] = Q[S D So \ So] 

^Q[S\So], 

R[K] = Q[7r-\K)] 
(36) 

= Q[SnSo] = Q[-{SnSo)\-So] 

(37) =Q[-S\-So]^Q[S], 

hence, R[K] = R[K] MK by (32). Thus by the Cra- 
mer-Wold device [cf. Bilhngsley (1979), page 334], 
R[B]=R[B] \/B, hence, setting A = tt~^ (B) in (33) 
and (34), Q[A \ Sq] = Q[A] \fA, which establi- 
shes (27). D 

Definition 2. y G RP is spherically symmet- 
ric = orthogonally invariant if y = TY for every 
orthogonal transformation L of R^. y is spherically 
symmetric about yo HY — yo is spherically symmet- 
ric. 

For example, y ~ iVp(0,(T^lp) is spherically sym- 
metric. Clearly spherical symmetry implies symme- 
try. It is well known that Y is spherically symmetric 
iff y is uniformly distributed on the unit sphere Sp 
and is independent of ||y||. We now use this fact to 
verify (4) by the following proposition, where 6, Sq, 
X , y, 0", ip, and r all depend on p. 



2^Cf. Ambartzumian [(1982), page 26], Watson [(1983), 
page 23]. 



Proposition 2. Assume that: 

(i) 60 is (fixed or) random is independent of X; 
(ii) Y = X — 5 is spherically symmetric; 
(iii) " »Y \ \ = L^"^ = o{p^''^) in probability as 
p — ;• 00 . Then [cf. (4)] 

limPTs[\\X-6o\\>\\6-So\\] 

(38) 

= lim Fis[X eB^] = l. 

The boundedness assumption (iii) is satisfied, for ex- 
ample, if X ^ Xp{6, cr^lp) and 60 ~ Np{ip, r^lp) with 
Wijj - 5\\/a = o{p) and T/a = o{p^/'^). 

Proof. Let /^p denote the uniform probability 
measure on Sp. By (i) and (ii), Pr5[X G Bi \ 5q\ de- 
pends on (5o only via \\5q — 5\\ (the radius of Bi), 
and 

(39) VTs[X&Bi\\\5o-m 

= Vi[Y(^Bi-5\\\5^-5\\] 

E{Pr[yG||y|ri(Si-<5)|||y||,||<^o-<5||]| 

= ni^p{\\Yr\B,-5))\\\5,-5\\}. 

Because Si — (5 is a ball with € d{Bi — 5), the set 
(||y||~^(i?i — 5))n5p is a spherical cap on Sp which, 
after some geometry, can be expressed as 

(41) {{zi,...,zp) '' "^ 
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hence. 



(40) 



> 



when ||y|| <2||(^o~'^||) and is empty otherwise. Fur- 
thermore, ^p can be represented as the distribution 
of Z , where Z = {Zi, . . . , Zp) ~ Np{0, Ip). Therefore, 

fip{\\Y\\-\B,-S)) 



(42) 



-Pr 

2 



Uf 


z! 

H 


+ z^ 


||y| 

-2\\5o- 
\\Y\\,\\6o 


2 

-d\\ 



when ||y|| < 2\\6o — 6\\, and = otherwise. Thus, 
by (40), 

Fvs[X e Bi\\\6o - 6\\] 



(43) 



< -Pr 

- 2 



zf 



ZI + --- + ZJ 



> 



\Y\ 



2\\6o-6\\^ 
\\^o-S\\ 



(44) 



PvsiXGB,] 



< -Pr 
- 2 



z- 



Zt + ' 



+ zi 



> 



\Y\ 



2\\5o-6\\'^ 



But Zf -\ \- Zp=0{p) in probability by the Law 

of Large Numbers, so by (iii), the right-hand side 
of (44) approaches as p ^ 00, which yields (38). 
D 

APPENDIX B: VERIFICATION OF (5) 

If we set /i(7) = \\6^{X- 60) - d\\^ and 6 = U^o + 5), 
then 



(45) 



h'il)=2iX-5oYiX-6) 

= 2[\\X-Sf-\\S-m- 

Since the right-hand side of (5) is the set {X \ h'{l) > 
0}, (5) follows. 

APPENDIX C: VERIFICATION OF (12) 

First note that 6jsiX;5o) ^ 5Jg{X;5o) iff ||X - 
6o\\ < o^p — 2 (the ball B\ of radius o^p — 2 cen- 
tered at 5o; see Figures 4a, 4b), in which case <^^g(-'^; 
(^o) = (^o- Define 

(46) C:={X\\\l]^{X-M)-^\\<\\X-m. 

(47) D := {X\\\6jsiX; 60) - 6\\ < \\X - 6\\}, 
so (12) is equivalent to 

(48) Pr[C|(^o]>Pr[I)|5o]. 
Since C\Bi = D\Bi, this is equivalent to 

(49) Pr[CnBi|(^o]>Pr[-DnSi|(5o]. 
But (see Figures 4a, 4b) 

(50) XeB2 => \\Sjs{X;do)-5\\>\\X-6\\, 

where B2 is the ball of radius \\6o — 5\\ centered at 5, 
so DnB2 = 0. Thus, 

(51) CnBi = {Bi\B2)^DnBi, 

hence, (49) holds. (Note that no distributional as- 
sumption on X is needed.) 

APPENDIX D: VERIFICATION OF 
FIGURES 3A AND 3B; VERIFICATION OF (19) 

First we verify that Figures 3a and 3b accurately 
depict the region 

(52) {So\\\6+s{X;So)-S\\<\\X-6\\}. 
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(a) ||5o-(5||<fVP^^ 
Fig. 4 



Illustrating the implication (50). 



a2(p-2)^ + 



Let7 = (l-pf^)+,soO<7<land5+(X;,5o) = 
7(X — 5q) + 5q. Each of the fohowing inequahties is 
equivalent to that in (52): 



< 



7)('5o 



\X-5\\\ 
-5)\X- 



5) 



S) 



\\{l-^){6o-d) + j{X- 

(l-7)'||<5o-<^f + 27(1 
<(l-7')||X-5f, 

{l-j)\\6o-5f+2j{6o-S)\X 
<(l + 7)||X-5f, 

\\6o-6f<j[\\X-6of] + \\X-5f. 

If ||(5o — X|P < (T^(p — 2), that is, Jq hes inside the ball 
of radius o^^/p — 2 (see Figures 3a, 3b), then 7 = 
and the last inequality becomes \\6o — (5|p < \\X — 
(5|p, which holds iff 60 lies inside the ball B of radius 
\\X-6\\ centered at (5. U \\6o- XW^ <a^{p- 2), that 

is. So lies outside this ball, then 7 = (1 — ^^-J iL ) 
and the last inequality instead is equivalent to each 
of the following: 

||<5o - 5f < \\X - <5of + \\X - <5f -a\p- 2), 



cj2(p-2) <2\\X 

a^{p-2)<2{X- 

a\p-2) 



-6\\'' + 2{X-6)\6-5o), 
5)\X-5o), 



2||X-^||<(^-^H^-^°)' 

which holds exactly in the open halfspace K shown 
in Figures 3a and 3b. Thus, the region (52) is the 
union BU K oi the cross-hatched regions in these 
figures. 



Finally, we verify (19), which now can be written 
equivalently as 



(53) 



lim Pr5[(^oG^Ui^]=0, 

p— >oo 



by the following proposition, in which 5, 5q, X, V, a, 
and r now depend on p. 

Proposition 3. Assume the following. ■ 

{[') V = 60 — X is independent of X; 
(ii') V is spherically symmetric; 

(iii') j|y|| = l^,~x\\ = o(//2) in probability as 
p^oo; 

(iv') a~'^\\X-6\\-\\V\\=a~'^\\X-6\\-\\6o-X\\ = 
Q^p3/2-j in probability asp— t- 00. Then [cf. (19)] 



(54) limPr5[||^j+g(X;<5o)-(^||>||X 



1. 



p— >oo 



The boundedness assumption (iii') [resp., (iv')/ is 
satisfied, for example, if X ^ Np{6,a'^lp) and Sq ~ 
Np{X, T^lp) with r/a = o{p^''^) [resp., a/r = oip^'"^)], 
so both are satisfied if t/ct ^p^ with < |e| < 1/2. 

Proof. By the argument that yielded (38) in 
Appendix A [with (i)-(iii) and X, Y , Bi — 5, and 
Po — (5|| replaced by (i')-(iii') and 5o, V, B — X, and 

\\X — S\\], we obtain 

(55) lim Fis[So eB]= lim Frs[V eB-X]=0. 

p— ^00 p— >cx> 

Next, again by the argument in Appendix A but 
with Bi — 6 replaced by K — X, 

(56) Pr^^^oei^lll^-'^ll] 
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(57) 



(58) 



■■E{t,,{\\V\r\K-Xm\X-6\\} 



a\p-2) 



-Pr 

2 



zl 



> 



Zl + '-' + Zj- 4||X 



(^||2||y||2 

\\X-5\\ 



Thus, by (iv') 



(59) 



lim VisldQdK] 

p—^oo 



— lim Pr 

2 p— >oo 



zl 



zl + --- + z^^ 

a\p-2f 



> 



4||X-(^||2||F||2 

so (53) and (54) are confirmed. D 
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